Modeling First Line Of An Order Book With 
Multivariate Marked Point Processes 



Alexis Fauth 1,2 Ciprian A. Tudor 3 ' 4 * 

1 SAMM, Universite Paris 1 Pantheon-Sorbonne, Paris, France. 
2 Invivoo, Courbevoie, France, 
alexis . fauth@invivoo. com 
3 Laboratoire Paul Painleve, Universite de Lille 1, Villeneuve d'Ascq, France. 
4 Department of Mathematics, Academy of Economical Studies, Bucharest, Romania. 

tudor@math.univ-lillel.fr 

November 20, 2012 



Abstract 

We introduce a new model in order to describe the fluctuation of 
tick-by-tick financial time series. Our model, based on marked point 
process, allows us to incorporate in a unique process the duration of 
the transaction and the corresponding volume of orders. 

The model is motivated by the fact that the "excitation" of the 
market is different in periods of time with low exchanged volume and 
high volume exchanged. We illustrate our result by numerical simula- 
tions on foreign exchange data sampling in millisecond. By checking 
the main stylized facts, we show that the model is consistent with 
the empirical data. We also find an interesting relation between the 
distribution of the volume of limited order and the volume of market 
orders. To conclude, we propose an application to risk management 
and we introduce a forecast procedure. 
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1 Introduction 



Buy and sell orders did not arrive at continuous times, counterparts do not 
meet them at any time, hence, the fluctuations of the stock market cannot 
evolve continuously and high frequency financial data have a flagrant dis- 
continuity The idea to use point processes to describe such irregularities 
appears as an evidence. 

The family of stochastic processes that we will use in our analysis, the 
so-called Hawkes processes, was introduced in the seventies by A. Hawkes in 
the paper [Ha71a]. The initial purpose was to model earthquake occurrences. 
A Hawkes process is a generalization of a standard point process. They al- 
ready found applications in various fields. We refer, among others, to Ogata 
[Og98, Og06] for statistical earthquake modeling, to Chavez-Demoulin et al. 
[ChHa09] and Errais et al. [EiTO] for risk analysis, Ait-Sahalia et al. [All] for 
the so-called financial contagion, to Bacry et al. [Balla] for high-frequency 
financial data and to Muni Tokes [TolO] for order book modeling. 

The scientific literature related to high frequency data analysis usually 
proposes to use self-exciting point processes that naturally correspond to the 
frequency of the arrival times of orders and do not take into account the 
corresponding volumes or the current return. Generally, in these models, at 
the same frequency, 'small' or 'large' volumes traded will impact the market 
in the same way. 

The literature on stylized facts for high frequency data is very rich. We 
recall the following properties frequently observed on the data: fat tails of 
financial returns, long memory on square returns, the multifractal nature, 
etc. For extensive reviews, see e.g. Dacorogna et al. [DaOl], Engle [EnOO], 
Bouchaud and Potters [BoPo04] or Voit [Vo05]. In the case of the very high 
frequency, which means transactions recorded tick by tick, the most common 
properties are the so-called Epps effect (see Epps [Ep79]), the signature plot 
(see Andersen et al. [AnOO]), the intertrade duration (see Hautsch, [Hal 2]), 
and the market impact (see Moro et al. [Mo09]). 

On the market, the price changes "tick by tick". The tick size is the 
minimum of the variation of the price. The price of an asset is written 
as p = np t i C k where n G N* and p t kk is the tick size. For example, it is 
between 10~ 4 and 0.5 x 10 _5 $ for the Forex market and usually 0.25$ for 
future contracts. There are two principal types of orders which can move the 



2 



price: the market order and the limited order. 

A market order will immediately be executed at the best available price 
and at the relevant time. Of course, a buy market order will be executed to 
the best ask price, while sell market order will be executed to the best bid 
price. The best ask (bid) price is understood at the lowest price at which 
an agent can buy (sell) an asset. The volume offered corresponding to best 
ask price need to be completely bought to see the best ask (bid) price move 
upward (downard) from one tick. 

A limited order is an order to buy (sell) an asset to a specific price, lower 
(higher) than the best ask (bid) price. Then, a new offered volume inside 
the bid-ask spread will move the price downward or upward depending if it 
is proposed on ask side or bid side. 

A market order will decrease the amount of shares available on the mar- 
ket, the volume, while a limited order inside the bid-ask spread will increase 
them. We schematize these two types of orders that may be vary the price 
in the figure (1). Thus, it is the exchange volume which affects the price and 
the financial fluctuation. During the excitation phase, i.e. when the price 
moves quickly, the market volume is quickly exchanged and hence, the prices 
vary rapidly. The time transaction and the volume are strongly dependent 
since one produces the other one. Several papers related to the theory of 
market impact try to deals with this aspect, see e.g. E. Moro et al. [Mo09], 
or Z. Eisler et al. [EiBoKolO]. Our purpose is to incorporate this information 
concerning the variable "volume of orders" in our model. To do this, we will 
employ the marked Hawkes process, their mark corresponding to the volume 
of each transaction. 

In order to reproduce the fluctuation of the price, we have to take into 
account the mechanism of the formation of the price. Consider for example a 
limited order book. Hence, we can observe the market order and the limited 
order. We have conscience of the existence of canceled orders too, but, since 
we do not have this information only looking to the order book, we cannot 
take it into account in our framework. By analyzing some SEC (Securities 
and Exchange Commission) reports or their counterparts in France, the AMF 
(Autorite des Marches Financiers) reports, we can notice that about 80% of 
orders are canceled. As this amount is not tangible, except by the regula- 
tory authorities, we choose to avoid that quantity. Then, the appearance of 
these different types of orders (marker orders and limited orders) produces 
the price. If we are able to reproduce this scheme, we should be able to 
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Figure 1: Schematic representation of an Order Book. In red volume pro- 
posed for sell, in blue volume proposed for buy. Left: initial state. Middle: 
same order book after a buy market order which move upward the best ask 
form one tick. Right: a new buy limited order inside the bid-ask spread 
which move upward the best bid. 

reconstruct the fluctuation of the price. 

The aim of this study is to build a model with two approaches, take into 
account in a certain way the limited order book but focusing on the best bid 
and the best ask prices. The mathematical tool used is represented by the 
marked Hawkes process. To test how realistic is our model, we will check the 
different stylized facts. In particular, to reproduce the Epps effect, that is, 
the correlation between two assets at different frequencies, the model will be 
presented in a multivariate form. 

First, we recall some existing models related to our problem and we will 
explain why we choose our approach. After, we recall the definition and the 
basic properties of the multivariate marked Hawkes processes and using them, 
we will introduce our model in d dimensions. We conclude by comparing the 
empirical data and the stylized facts to our model and we give some ideas 
on how to apply this result to trading and risk management. All numerical 
computations are done on foreign exchange market. 
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2 Literature Review 



We have a very interesting literature on modeling very high frequency data 
using point processes, and in particular Hawkes processes. Basically, we have 
two main approaches. 

Let us recall the main ideas of these two approaches. In the first one, 
one tries to mode the data using only the tick information, see e.g. [Balla], 
which is the frequency of order arrival. Such models have the following form 

p(t) = N+(t)-N-(t), (2.1) 

where pit) is the asset price and N ± (t) is a counting process corresponding to 
the number of upward and downward jumps. The "asset price" is considered 
as the last price, or the mid-price. The mid-price is the mean between ask 
and bid price, (p as k+Pbid)/2- Unfortunately, in that case, we do not have any 
information about the bid and ask prices. The spread, which is the difference 
between bid price and ask price is a well known cost by practitioner. The 
best bid price is the highest price at which an impatient trader sell his asset, 
in other words it is the market price to sell-short an asset. In a similar way, 
the best ask price is the lowest price at which an impatient trader can buy 
an asset. Hence, if the agent buys at market kp as k = x$ k shares of an asset, 
where k is the amount of contracts bought and then needs to sell just after, 
the agent has to sell at bid price which is, assuming no fluctuation happened 
during that laps time, kpud = (x§ — s)k where s is the bid-ask spread, 

S = Pask - Pbid- 

Consequently, the transaction cost is ks$. In order to optimize his trading 
activity, any agent has to take into account this quantity; of course, we have 
to add other inherent costs such as broker fees and the market impact. Re- 
call that market impact is the effect that buying or selling moves the price 
against the trader: upward when buying, downward when selling. 

The second approach used in the literature is to try to model the lim- 
ited order book, which is somehow more complicated. These models, to be 
accurate and consistent to the problem, requires to take into account other 
aspects and not only the transaction price and the trade duration. In [La07], 
J. Large try to work with ten types of orders: 

1. Market buy that moves the ask price 
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2. 


Market sale that moves the bid price 


3. 


Bid between the quotes 




4. 


Ask between the quotes 




c 
0. 


Market buy that doesn't move 


the ask price 


6. 


Market sale that doesn't move 


the bid price 


7. 


Bid at or below bid price 




8. 


Ask at or above ask price 




9. 


Cancelled bids 




10. 


Cancelled asks 





Notice that the aim of the above reference is to measure the resiliency of an 
limited order book. Resiliency is the ability to a market to come back to 
his equilibrium after a large aggressive order. Any of this type of order is 
modelized by a counting process iVj with intensity Aj, i = 1, ■ ■ ■ , 10. 

I. M. Toke, [TolO] simplifies the formulation by proposing to describe a 
limited order book by a two-agents model. One patient trader who submits 
only limits orders who sometimes cancels his own orders. The second agent 
is impatient and submits only market orders. In both cases, what is really 
interesting, is that we have a corresponding volume of orders drawn from an 
exponential law. The cancelation of an order arrives with a certain proba- 
bility 5. Other quantities are drawn from a Hawkes process. 

In both cases, it become complicated and time consuming to extend such 
models in a 2-dimension way and to include the lead-lag effect or the Epps 
effect. 

3 Multivariate Marked Hawkes Process 

Let N(t) a <i-dimensional point process, iV = (Ni, N%, ■ ■ ■ , N^), with Ni(ti), 
1 < i < d the cumulative number of events for the i th component a time t,. 
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Note that this process iVj(tj), % = l,..,d must be non-decreasing and right 
continuous, and take only non-negative integer values. 

All along this paper, we will work on a standard probability space (Q, J 7 , P) 
and on this space we will consider N(t) a d- dimensional J-^-adapted process, 
t — (ti, • - • , td). To simplify notations, we will often use the notation t 
instead of t{ even for point in M. 

An important characteristic of the process N is its conditional intensity 
with respect to the filtration considered and this conditional intensity is given 



The intensity characterizes the evolution of the process N(t) with respect 
to this past history jF t . We could interpret that as the conditional probability 
at time t to observe a new event at next time t + St. 

The Hawkes process is a particular class of the point processes. The first 
definition of the Hawkes process, which is quite specific, has been given in 
[Ha71a]. Today, one calls a larger class of point processes as Hawkes pro- 
cesses. This class is characterized by the intensity function. More precisely, 
the intensity of of multivariate ci-dimensional Hawkes process is usually de- 
fined by 



where h = {/ii}»=i is the so-called decay kernel and satisfies hi(t) > for 
all i. The parameters {yij}i,j=i,- ,<z are referred as the branching coefficient, 
it quantifies the ability of an event i to trigger an event of type j. Thus, we 
see the mutually exciting structure of the process since event j ^ i can affect 
the conditional intensity Aj. Self-exciting part is of course the case j = i, 
past and current event will induce a response in their own intensity process 
and therefore, on the corresponding point process. The constants { / u i } i= i i ... ^ 
are understood as the rate of instantaneous events. 

The branching matrix is the average number of event j triggered by an 
event of type i. Consider the case d — 2 and the associated branching matrix 



by 



\(t\F t ) = lim — E [N{t + St) - N(t)\F t ] . 



(3.1) 




(3.2) 




(3.3) 
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In this case ui 2 is interpreted as the average number of event of type i = 1 
with parent event type j = 2. The integral (3.2) of the kernel with respect 
to the counting process N can be expressed as 



— oo 



hi(t-s)N{ds) = Ht-tk), 



k\t<t k 



(3.4) 



where t is the physical time and {tk}k=i,--- ,n represents the arrival times of 
events. If a new event occurs the intensity decreases. This phenomena is 
described by the decay kernel h. A natural and standard choice of the 
function h is, 



for every i — 1, .., d. If we observe the arrival of a new event, then the decay 
function will be close to a« > and if no new event occurs, then the decay 
function will tend towards exponentially. 

The purpose of this work is to incorporate the volume of the orders in our 
model. As mentioned before, we will use multivariate Hawkes process but 
we will incorporate the marks in the intensity expression (3.2). A mark is 
an additional value attached to each point and brings some new information 
about the points. Consequently, we will have marked intensities, denoted in 
the sequel X(t, Vt\J-t) where Vt represents the mark and will model the volume 
of the orders at time t. The marked intensity (see e.g. Daley and D. Vere- 
Jones 03, [DaVe03] and Liniger 09, [Li09] for an extensive review) takes the 
form, 



Now, Tt is the history of arrival time and corresponding mark, {ti,v(ti)}. 
The function gj, j = 1, • • • , d is the so-called impact function of marks, in 
other words, it characterizes the impact of the volume on the financial asset 
fluctuation. It has to satisfy the condition f Rd g{y)f{v)dv = 1 where / is 
the density of the vector V — (Vy, ■ ■ ■ , Vd), V% = (fi(l), • • • , Vi(t), ■ ■ • , Vi(T)). 
Most common choices for the impact function are, 



hi(t - t k ) = oti exp(-«i(t - t fc )), ati > 



(3.5) 




(3.6) 
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g(x) = x a , a > 0, and g(x) = exp(aa;), a > 0. (3.7) 

Naturally, the first one corresponds to an impact in power law, and the 
second one means an exponential impact of the volume on the asset price. 
We used the notation g because we need to normalize the function in order to 
respect condition j Rd g(v)f(v)dv = 1. This normalization condition is mainly 
made for a comprehensive way than anything else. Using that representation 
implies that branching coefficient appears explicitly (see [Li09]), and then 
we could interpret the different relation and deals simply with condition on 
branching matrix eigenvalues. Then the corresponding normalized versions 
of (3.7) are: 

x a exp(cta;) 
9{x) = a > 0, and g[x) = ^ x , a > (3.8) 

where X is a random variable with density /. 

Without the impact function, or in other words without taking into ac- 
count the mark, the intensity's increase depends only on the time duration 
between two events. Using marked Hawkes process, the intensity increases 
with respect to the arrival time of events but also with respect to the mark 
value. So intensity, and finally the price, will be influenced by the trading 
volume. This is the fundamental reason why we have chosen to work with 
this process. Indeed, the market impact depends on the degree of excitation, 
but not only on the frequency of the arrival of significant events. It is also 
affected by the amount of the exchanges at each moment. It it then natural 
to use marked point process instead of unmarked process. 

The kernel hi(t — s), i = 1,2, ...,d gives us the probability that at time 
s < t, an event of type j to trigger an event j at time t. Notice that as for 
the non marked and univariate case, we simply have, 

d 

\ i (t,v\F t ) = f i i + J2"ij E Ht-t^) 9j (v(t^)). (3.9) 
j=1 k\t[ j) <t 

The multivariate marked Hawkes process is well defined and stationary 
if in (3.6) we have the spectral radius of the branching coefficients matrix, 
denoted v, less than 1: 

max |Aj| < 1, i — 1, • • • , n (3.10) 
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where Ai, • • • , A n are the eigenvalues of the branching matrix. Also, the decay 
function verifies: 

/■oo 

/ thi(t)dt < oo. (3.11) 
Jo 

The first condition gives, after numerical parameters estimation, a first re- 
sponse if the Hawkes process defined above fits with the practical situation 
or not. Notice that the coefficients of the matrix v = {yij}i,j=i, - 4 are them- 
selves issued of the estimation. This first response is a necessary but not 
a sufficient condition to validate the model. We will discuss further in the 
section devoted to the estimation procedure. The second condition implies 
some restriction in the choice of decay kernel function h. 

Notice that to verify quality simulation, the Hawkes process provides 
by their nature two goodnest-of-fit tests. First one is that the integrated 
intensity between two consecutive events follows an exponential law of rate 
1, 

j \{t,v\JF t )&t xdu~Exp(l). (3.12) 

The second one is the called random time changed theorem (see Proposition 
7.4.VI (b) in [DaVe03]), if / °° Xj(t, v t \T t ) = 00 for all j = 1,--- ,d, then, 
under the random time transformation, 

(t,v)^(A(t,v),v), (3.13) 

the marked point process is transformed into a compound Poisson process N 
with unit ground rate and stationary mark distribution /. 



4 Modeling high-frequency data 

When someone tries to model an order book, he, she has to be aware that 
he models only one order book corresponding to one broker. Each broker 
has his own order book, his own agents market participants. Usually, thanks 
to arbitragers and market maker, in any case, we have the same last price. 
That kind of arbitrage opportunity, when the bid and ask prices differ from 
one broker to another, is quickly detect. On Forex market, due to the mul- 
tiplication of market places, looking to the action of only one broker has no 
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really sense, that's why we commonly have just best bid and best ask prices 
with corresponding volume. That's why we focus only on these two price 
processes. 

Let us recall that the two major aims of our study are to give a model 
able to reproduce financial fluctuations and to take into account the bid-ask 
spread. To have an idea on the bid-ask spread, we think that the most simple 
and realistic model is to diffuse conjointly bid and ask prices, and thus, to 
deduce the bid-ask spread. We have to take into account the dependence 
between the two kind of prices and of course, this two quantities are usually 
strong correlated. 

Also, as we mentioned before, an important volume of trading does not 
affect market in same manner as a small volume does. One of the most pop- 
ular stylized facts, the so-called cluster of return, implies that large changes 
are usually followed by large changes. It is therefore natural to think that a 
large exchanged volume could trigger many trades, exciting the market. We 
have plotted on (2) the log-return of EUR/USD in "deci" -seconds sampling 
to see these phenomena. 




Figure 2: EUR/USD log-return from 14:00:00 to 14:10:00, 06.02.2012 in de- 
cisecond sampling. 
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To confirm this idea about the impact of the trading volume on the price 
we present In Figure (3) the plot of the time interval between the change 
of volume and the corresponding volume variation. Data are EUR/USD 
and EUR/GBP form 30-01-2012 to 10-03-2012 recorded in milliseconds. We 
clearly see on that plot that a big volume variation corresponds to a short 
variation. The intertrade time interval (or simply duration), is then defined 
as the laps time between two consecutive orders, 

d k = t k -t k -i (4.1) 




i a » 3) i a t « a s « » s 



Figure 3: Volume variation with respect to duration, second scale. In blue 
(red), the change in ask (bid) volume. Left: EUR/USD, right: EUR/GBP. 
The variations are given in Million of USD. Notice that the duration near to 
1 minute correspond to the price recorded around midnight GMT. 

Let us now explain our model. Writing a for best ask price, b for best 
bid price, + for upward jump and - for downward jump, our model takes the 
form: 
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Ask: 



Bid: 



A a ,+(0 = /V+ + u *,+>i>+ h aj+ (t - s)g j (v)N j (ds x dv) 

j=a+,b+ J(-°o,t)xR 
j=o-,6- ^(-oo,t)xR 

Aft,+(t) = /4>,+ + 5^ Vb,+,j,+ h bt+ (t - s)gj(v)Nj(ds x du) 

i=a+,6+ ^(-oo,t)xM 

A 6 -(0 = a*6,- + 5^ "b-,j- h b -(t-s)g j (v)N j (dsxdv). 



(4.2) 

The above model is univariate, corresponding to only one agent. Let us 
explain the above notation: the quantity A aj+ represents the intensity asso- 
ciated with the ask price when this price had an upward move. Similarly are 
interpreted the intensities A a _,Ab i+ and A^-. The scalar z/ a+jfe + quantifies 
the interaction between the ask price and the bid price (in the upward jump 
case) . 

We have made here some choices on the interactions and the dependence 
between the agents. First, there is no interaction between upward and down- 
ward jump, so in the above notation = = 0, for all i,j = a,b. 
The dependence between upward jump on bid side and ask side is included 
in the model and the same holds for the downward jump. Let us explain 
that choices. Since we can naturally think that it is the bid side (respec- 
tively the ask side) which governed the decrease (respectively increase) of 
the price process, therefore we don't have any process corresponding to the 
dependence between + and — . Since it is empirically not possible to observe 
a bid-ask spread tending to infinity we introduced a dependence between up- 
ward (downward) jump for the bid and ask prices. Even if they do not evolve 
in a symmetric way (otherwise the spread will be constant) one follows the 
other. The case j — % is then the self-exciting part of the intensity while 
j 7^ i is the mutually exciting part. 

In this modeling, we deal only with trades that triggers a shift on price. 
Market order which just pick a part of the offered volume and limited order 
bigger than the best ask price or lower than the best bid price are not take 
into account. Indeed, conditional intensity corresponding to best ask price 
which produce an upward jump correspond to buy market orders, conversely, 
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the best bid price which moves the price downward is a sell market order 
which completly take the liquidity available on the corresponding tick line. 
Hence, Since that two quantities attempt to modelise market order, the case 
of market order who don't move the price is implicitely took into a count 
by the fact that no event occurs. On the other side, ask (bid) price which 
trigger an downward (upward) jump is a new limited order inside the bid- 
ask spread. Then, for the same reasons as before, limited orders outside the 
bid-ask spread are implicitly took into account when no events occurs. 

In one hand, remind that the conditional intensity is the probability to 
observe a new event knowing the history. On the other hand, like intensi- 
ties are marked, in other word, they depend not only on the frequency of 
arrival orders, but also of the corresponding volume. Then, more exhanged 
volume will be important, more intensity value will be important, and then, 
more the probability of a new occurence, a new price ffuctutation, will in- 
crease. Thus, we are consistent with fondamental observation like (2) or (3). 

Since the intensities describe completely the marked point process, know- 
ing Xj permit to simulate a sequence of marked point process {£;, v (ti)}i=i,... ,t 
(see e.g. [Og81], or [Li09]) and then, to deduce the counting process N(t). 
Denoting by p a and p b respectively ask and bid prices, we will have 

p a {t) = p(0) + (N ai+ (t) - iV 0i _(t))p tick 
p b (t) = p(0) + (N b>+ (t) - N bj _(t))p tick 

where p t ick is the tick size, 10~ 5 for our parities in our data base, and 
N a!+ constitutes a counting process with intensity A ai+ and similarly for 

Na,-,N b+ ,N b _. 

We have two distinct formulations for the different counting processes: 
• Consider iV as a simple counting process, which is 

n 

N(t) = J2Mt i <tXt i )=*} (4-4) 
i=i 

Assuming that at each new event, there is a corresponding mark, in- 
deed, this the case here because a new price fluctuation is necessarly 
associated to a new quantity of volume added or removed. Then, the 
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counting process take the same value than in a non marked case , ex- 
cept that the frequency will strongly affected by value of volume. Also, 
using a multivariate point process we have not only a bid and ask price, 
but also their corresponding volume, v a> ± and v^. 

• The marked case is an appropriate way to produce compound counting 
processes. Denote 

N{t) n 

N g (t) = Y,v(U), N(t) =X)l ti <«, (4.5) 
i=i i=i 

where g stands for a growth process. Then, if we keep formulation 
(4.5) the price fluctuation would be directly affected by the volume. 
This version is not compatible with our representation (4.3) since the 
price fluctuation is not exactly the sum of the exchanged volume on 
the market. But we keep these things in mind for a future article with 
a different price formulation. 

The model defined by the formulas (4.2) and (4.3) corresponds to a uni- 
variate stochastic process. If one tries to incorporate some dependence be- 
tween the assets prices, and to have for example the Epps effect, and/or the 
lead-lag effect, one a priori needs to extend the model to a a multivariate 
form. The question is where we have to assume z/y ^ and hi ^ be- 
tween the two assets? Of course we could introduce dependency between 
each component, but it is well known fact in financial modeling that a very 
complicated model will be very difficult to calibrate, instable, serve mainly 
for academics and not for practitioners. Then, we assume only two types 
of interaction: the dependence for upward jumps in ask side in both assets, 
and the same for the downward jumps. Since all coefficient of the branching 
matrix are assumed to be positive or equal to zero, if the two assets are nega- 
tively correlated, we don't have any other choice that to introduce interaction 
between upward jump of i and downward jumps of j. As in the univariate 
case, we don't assume any interaction between upward and downward jumps 
for the same asset so we delete one sign for the sake of simplicity. 

Let us mathematically express the multivariate model. Writing \ * + (t) 
for the intensity of the asset i = 1, • • • , d in the upward jump case and Xj{t) 
as in (4.2), asset i is characterized by, 
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(4.6) 



(i), 



Hence, first part of any intensity correspond to the univariate case, second 
one is interaction in the case of positive dependance between assets and third 
one is the negative dependence between assets. For exemple, conditional 
intensity A a + is drawn by it's self exciting part, interaction with the upward 
jumps on best bid price and, interaction with upward and downward jumps 
on best ask prices of other assets. Of course, if the model is quite relevant, if 
asset i is EUR/USD and j is EUR/GBP, or USD/JPY and GBP/JPY (JPY 
= Japonese Yen), the branching coefficient for upward jumps in the best 
ask price should be positive while it should be close to zero for downward 
jumps like these two assets seem to fluctuate more or less together the last 
year. Conversaly, we should have a coefficient close to zero for upward jumps 
between the EUR/USD and the USD/CHF (CHF =Swiss franc) and positive 
for downward jumps. To be clear, we present in Table (1), the interactions 
that we take into account: 
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Table 1: V if true, '-' if false, i ^ j — 1, ■ ■ • , d. Notice is nothing else than 
the branching matrix v. As we can see, in the case of two assets, we have for 
each quantity, four components, it self to produce the auto-excitation effect 
and two other, the corresponding quantities in the other asset, and bid with 
similar jump on the two assets. Up left and bottom right quart correspond 
to monovariate case i and j. 



5 Estimation Procedures 

Let us explain our estimation technique. There are several approaches to 
estimate the parameters in our model. To simulate a point process, we can 
apply thinning algorithm in [DaVe03], [Li09] or [Og81]. Of course, before 
applying one of them, we need to estimate the parameters of the model. The 
most natural estimation is given by the maximum of likelihood method . Let 

6 be the parameters set which depend on mark distribution f = (/i, • • • , fd)- 
also consider the impact function g = (<?i, ■ ■ ■ ,gd), the decay kernel h = 
(hi, ■■• , h d ), the branching matrix, v = {^}ij=i,... ,d an d A* = (yUi, • • • ,fJ>d)- 
Let I = [T_, T + ] be interval containing all arrival times. With these notation, 
the likelihood function is given by: 



L{{t u Vi}]Q) = T[ / XjfavtyltyNjidt x dv)exp(-A J (T + )), (5.1) 

• =1 JlxR 

where Aj(T) is the compensator, or integrated intensity given by 

Aj(T) = f \ 5 (t,v\Ft)dt x dv, j E {1, • • • , d}. (5.2) 

J —oo 

The discrete version of log-likelihood is then expressed as follows: 
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d N d 

log L{{U, v,}; 0) = E lo S A ^ tfe > ^ " E A ^ T+ )' ( 5 - 3 ) 
i=i fc=i j=i 

The maximization of log-likelihood could be done using a classical optimiza- 
tion algorithm. Notice that Bacry et al. [Balla] proposed an alternative 
approach to estimate the parameters: the first find empirically the parame- 
ter set and the one minimizes the mean square error between the empirical 
signature plot and its estimate. 

As we have said before, mark impact functions has to be normalized, and 
in order to do this we have to know their distribution function. To perform 
our analysis, we use two parities, EUR/USD and EUR/GBP recorded in 
milliseconds from January 30, 2012 00:00:00 to March 09, 2012 21:59:00, one 
month ans one week. During this period, we have recorded 3 352 809 trades in 
EUR/USD and 2 178 009 for EUR/GBP. As the number of data is too large, 
we cut the historic week and averaged each result, then, we have 5 week for 
each parity. Data were cleaned using classical procedure, see [DaOl]. Hence, 
we plot cumulative distribution F(V > x) of ask volume which has produces 
an upward and downward jump separately at corresponding time, idem for 
bid volume. Results are given on figure (4). In order to find the mark dis- 
tribution and therefore to compute likelihood function, two fits are proposed. 

First fit are classically obtained with Gaussian law, fc{. x ) — ~^p^ ex P( ~^^ \ 
where fi is the mean and a the standard deviation. The second one is an 
exponential law, cx exp(— (3x), (3 > 0. 

Not surprisingly, the Gaussian fit gives the worst fitting, tails are not 
described correctly, they vanish too quickly. The exponential law provides a 
very good fit of the bid and ask volume distribution. What is very interesting 
to see on this plot, on EUR/USD as on EUR/GBP, is that the cumulative 
distribution for ask volume when occurs an upward jump was very close to 
bid volume when occurs a downward jump. We have the same characteristic 
for the ask volume when occurs a downward jump and for the bid volume 
when occurs an upward jump. There are two kinds of orders that move the 
price, namely the market orders who take all available shares and limited 
orders between the quotes. 
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When a limited order to buy some shares is passed inside the quotes, it 
increases the bid price (upward jump on bid side). Reciprocally when an 
limited order appears bellow the ask price, it decreases ask price (downward 
jump on ask side). 

When a buy marked order take all available shares, ask price increase 
to the next limit available in the limited order book, (upward jump on ask 
side). Reciprocally, when a sell market order take all available shares of the 
first limit, bid price below to the next limit available on the order book 
(downward jump on bid side). 

We clearly see on figure (4) the two behaviors distinctly. Quantity offer 
by a new limited order inside the quotes was greater than the quantity offered 
by an 'old' limited order which becomes the new bid or ask price. 




Figure 4: Cumulative distribution in log scale of ask volume in a case of 
upward jump (blue circle), ask volume in a case of downward jump (blue 
cross), bid volume in a case of upward jump (red circle), bid volume in 
a case of downward jump (red cross) on EUR/USD (left) and EUR/GBP 
(right), from January 30, 2012 00:00:00 to March 09, 2012 21:59:00. Dashed 
line, gaussian fit, solid line, exponential fit. 
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Hence, we will choose exponential law /3exp(— (3x), > 0. A suitable 
impact function is the power law, x a , take into account normalizing condition 
we have: 



at time t + 8t, the impact function quantifies the impact of the volume on 
the trade frequency. Thus, knowing empirical result, e.g. [LiFaMa03], we 
see that the exchanged shares with respect to price fluctuation seem envolve 
according a power law. Like this the instant of trade who produce price 
fluctuation, we choose a power law impact function. As we have explained 
before, market impact is the shifting induce by a trade. Simple example to 
price impact is to want to buy x shares of an asset at market price, but, if 
available volume at market price is strictly less than x, say x m , agent will 
buy x m shares at market price and x — x m at the next limit, and so one if 
the available share at next limit is strictly less than x — x m . Thus, a natural 
problem is trading strategy for liquidition of large volume, we can cite for 
example the seminal work of R. Almgren, see e.g. [A109], where the impact 
function is a power law. 

We still have an important number of parameters to estimate. We have 
sufficient information to perform maximization of (5.3) with classical opti- 
mization algorithm. Looking in detail theory of marked Hawkes process give 
us some way to simplify calibration. 

The decay kernel hi(t — s) gives us the probability that an event of type i 
will trigger an event of type j at time t. In other words, it is the distribution 
of waiting time t — s between the impact of event % to the system at time s 
and the occurrence of an event j at time t. Brunching parameters Uij is the 
average number of event j triggered by event of type j. This two quantities 
can be estimates with usual statistical techniques. 

Finally, we do not have assume any restriction on the value of the param- 
eters. As we have say before, we admit an interaction between upward jumps 
on best bid and best ask price, and similarly for downward jump. Since the 
price process is of course drawn by a certain probability, it is possible to 
observe after the simulations that the ask and bid prices diverge. Hence, to 
avoid this problem we assume that instantaneous intensity for upward jumps 




(5.4) 



Since the conditional intensity is the probability to observe a new event 
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on the ask and the instantaneous intensity for upward jumps are equals, 
fJ>a+ = /4+- The same assumption is considered for the downward jumps 

(0 W 
Ma- = Mb-- 

After the estimation procedure for the parities, EUR/USD and EUR/GBP, 
we have found a spectral radius between 0.71 and 0.84 for each week and its 
averages is 0.81. Hence, the Hawkes process is well defined. 

We now present the conditional marked intensities realization correspond- 
ing to our ask and bid model on EUR/USD on figure (5) for ask prices and 
(6) for bid prices. For a plot more comprehensive, we have plotted 2000 
dots in 'deci'-seconds, then, a total of 3 minutes and 23 seconds, form 30-01- 
2012, 00:12:53 to 00:16:32. The procedure using data on deci-seconds is the 
classical interpolation formula known as the tick estimator: 

t = arg max{tj < t},i = 1,2, ■■ ■ (5.5) 

on the regular grid A t = {Osec.10, Osec.20, ■ ■ • } (see [DaOl] for more details). 
As we can see on the plot, it is possible to observe variation of the intensities 
while the corresponding quantities do not vary. In fact, since we have as- 
sumed interactions between upward jumps on ask price and upward jump on 
bid price, if the bid price increase, the conditional intensity A a+ will increase 
whereas ask price don't have increase. 



6 Stylized Fact 

Our study is focused on Forex markets, and in this situation there is not a 
"last price", there is only bid and ask price. The Forex market is what is usu- 
ally called an "OTC" (Over the Counter) market. In contrast to other mar- 
kets, such as options or futures markets, there is no central reporting facility 
for OTC markets (in particular for Forex). The Forex market is international 
in the sense that each market place takes the relay of the other, trades are 
continuously recorded, they are no closing hours such as in the stock mar- 
ket, hence parities evolve without interruption throughout the whole day. It 
opens with the Australian (Sydney) market at 20:15 GMT on Sunday and 
it closes with the US (New- York) market at 22:00 GMT Friday. Hence there 
is no official "tape". The last traded price for Forex entirely depends on 
where you look. Bloomberg, Reuters, Yahoo, Google, IB, etc. have each one 
different combinations of pools of liquidity from which they are gleaning this 
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Figure 5: Ask volume in gray, blue (red) circle correspond to upward (down- 
ward) jumps corresponding conditional marked intensities are plotted in line 
blue and red. Black dots are ask prices. 




Figure 6: Bid volume in gray, blue (red) circle correspond to upward (down- 
ward) jumps corresponding conditional marked intensities are plotted in line 
blue and red. Black dots are bid prices. 
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Figure 7: Blue: best ask price simulation, red: best bid price simulation. 
Inset: corresponding bid-ask spread. 

information. Among the major interbank market, we have Deutsche Bank, 
City and Barclays Investissement Bank. 

If one looks at the financial time series with sufficiently low frequency, 
for example in daily or weekly scaling, even little higher if the considered 
asset is very liquid, for example hourly, we observe a continuous process. 
For higher frequency, the continuous time hypothesis is clearly rejected. The 
price of an asset, currency, stock, commodity and so on, is the consequence 
of a the transactions which have been made. If all agents buy an asset, the 
price will tend to increase, inversely, if all agents sell an asset, the price will 
tend to decrease. It is the interaction between agents which creates the price 
fluctuation that we observe. This processes is not continuous in time, this is 
not necessarily at each milli-second that an agent finds a counterpart to buy 
(resp. sell) what he wants to sell (resp. buy), hence, the price stops moving 
until deal happen. That's why fluctuation has to be model as a point process. 
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As told before, there are some characteristics of very high frequency finan- 
cial data. For each study of these quantities, we will compare the empirical 
result, the best fit obtained with an appropriate law and the realization given 
by the model. 

Let X(t) = logp(t), the log price of an asset. The signature plot, intro- 
duced in [AnOO], corresponds to the volatility evolution on [0, T] with respect 
to frequency, 

1 [TM 

^x(r) = - (X((n + - X{nr)f (6.1) 

n=0 

so more the lag r is large, more this quantity is decreasing in power law. The 
explanation of that signature is quite simple: at very high frequency, data 
arrives at more or less distant time, asset prices are discontinuous and show 
jump, hence, volatility tend to increase with the frequency because of these 
jumps. Conversely, the fluctuations observed at large scale tend to smooth 
evolutions, hence, volatility tends to decrease until their equilibrium value. 
This effect is plotted on (8) using mid price quote by averaging week by week. 



s _ 



s _ 



o _ 




Figure 8: Signature plot of EUR/USD in black and in red for the simulation, 
fit in power law (green), r varying for to 100 seconds. 

The Epps effect, introduced in [Ep79], is the evolution of the correlation 
between two financial assets with respect to the frequency. As frequency 
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increases, the correlation tends to vanish. This not very complicated to 
realize, if the orders arrive at discontinuous time. On the other hand this 
is not easy to observe if the orders, say sell orders, of the two considered 
financial assets arrive at exactly the same times and then, we observe a 
correlation, although if we are looking for two assets very liquid, it will be 
always a lag and the correlation tends to vanish at very high frequency. 

PiAV = , = (6-2) 

V V *i( r ) V * 2 ( T ) 

where Vjy(r) is define like in (6.1) and the empirical covariation Co is given 
by, 



, \T/r\ 

Co 1)2 (r) = - (M(n + l)r) - X l {nr)) (X 2 ((n + l)r) - X 2 (nr)) . 

n=0 

This effect is plotted on (9) using mid price quote by averaging week by 
week. 




Figure 9: Epps effect on the two considered currency foreign exchange data, 
EUR/USD and EUR/GBP, in black and in red for the simulation, corre- 
sponding fit in power law, r varying for to 100 seconds. 
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7 Forecast and Application to Risk Manage- 
ment 



Even if we have performed our analysis on the particular case of the Forex 
market, we can easily apply our model to various types of financial assets. 
The analysis of the bid/ask price fluctuations gives us the spread, and then, 
an important fees component. We therefore can deduce a very interesting 
stress test type for high frequency trading strategies. 

A well-known fact to practitioners is that, sometimes it is not very com- 
plicated to find some "rules" in order to deduce the next market move, but if 
one includes the broker fees, market impact and bid-ask spread, the detected 
gains probabilities will reduce to zero, and there is a cost to be paid. 

Let us explain in details these kinds of transaction costs. The broker fees 
depend mainly on the broker and correspond in general to a percentage of 
effective value held. 

The market impact is the price to be paid to pass an order. Since the 
available volume on each limit is not infinite, if the amount of the desired 
volume is strictly bigger than the available volume at the market price, the 
agent will take all shares at the market price and then, he will take shares 
to the next limit and so one, until he/she bought (sold) the desired quantity. 
Hence, his transaction is equal to V = J2k z (P a x *Ptick) where p t ick is the 
tick size, Xi measures the distance between the best price and the next limit 
with x\ = 0, Xi G N* and k = Yli=i ^ * s the quantity that he/she has buy 
at limit p a + a^ptick- Finally, his fees due to the market impact is equal to 

Consequently, the bid-ask spread is then the price to pay to an impa- 
tient trader who passes only orders at the market prices. Since there is no 
a unique price on the market but a price for buy (ask) and a price for sell 
(bid), a transaction consisting to buy at market price k shares and closes its 
position at time t + 5t at the market price, will have a cost of ks(t + 5t), with 
s denoting the spread. 

To test high frequency trading strategies, incorporating the spread in the 
model is thus fundamental. The spread is directly deduced as the difference 
of the two simulations of the bid and ask prices. To incorporate the market 
impact, it is quite more difficult. We need to know the next limit and the 
corresponding volume. Without loss of generality, we could argue that the 
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next price after the best ask price, p a , is equal to p a +p t ick and so one. Simi- 
larly for the bid price. This hypothesis is completely possible in a sufficiently 
liquid market, and using the same notation as before, the market impact 
cost will then be equal to J2i=i h%iPtick = Y17=i kiVtick- So the knowledge of 
the best quotes allows to deduce the next limits. But, we do not know the 
available share on each limit. For the simulation we have simply drawn the 
volume process with an exponential law. Of course, statistical properties of 
the exchanged volume are very rich and need an extensive study to build a 
robust model, see e.g. [GoPlGaStOO]. 

Finally, by using the structure of the intensity of the Hawkes process, 
one can deduce a certain forecast strategy. It is shown in [Ve95] that the 
probability of next time event r* that appears after time r is given by the 
formula 

P( r * > r ) = exp ^- jf J X(s, v\F a )dvdsj . (7.1) 

We can find such r that maximizes the probability for all the 4 intensities in 
the model (4.2) or for the d*4 intensities in (4.6). To perform the simulation, 
notice that we have to update at any time t + 5t until t + r history T%. With 
this information, we could deduce what will be the next event and take the 
corresponding position. In contrast to the most part of the literature which 
takes into account only mid or last price, knowing the probability that a 
downward or upward jump will happen on the bid or ask side give us an idea 
on the next price movement, and also the expected spread. This information 
indicates if the trade has a chance or not to be profitable. Of course, to 
deduce the size of the next bid-ask spread, we need to have an idea about 
the next bid ask price value. 

Producing the mark at any time t + 5 until t + r is also needed to perform 
our simulation. The mark value, v(t + 8t) can be generated from the density: 
f( v ) = J R %+fCv\^ t + +st)dv - Addm § the generated pair (t + 8t\ v(t + 6t)) to the 
history permits to perform the exercise and moreover gives us all information 
needed to the next time price movement in a risk management framework: 
the spread and the market impact. 
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8 Conclusion 



In this paper we propose a model that describes the first line of an order 
book, which is represented by the best bid and best ask prices. The main 
line of our work is to say that the arrival of the orders does not depend only 
on the past history, but also on the volume. To this end, the idea is to use a 
marked point process instead of an unmarked one. Small quantity of shares 
does not affect the market in the same as huge quantities do, and then, the 
arrivals intensities changes. These two quantities are closely linked. 

We have briefly discussed about the compound case, see (4.5). We empha- 
size that it could be interesting to study further this representation. Finally, 
it seems possible to extend our model to an important part of modern finance 
theory such as computing Value at Risk or option pricing via a Monte Carlo 
scheme for very high frequency data. 
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